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A method of estimating a motion field includes estimating a 
first motion vector component from a source average vector 
and a target average vector; displacing the search area 
according to the first motion vector component; and esti- 
mating a second motion vector component from the dis- 
placed search area. 
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BLOCK MATCHING-BASED METHOD FOR limitations in estimating large displacements because verti- 

ESTIMATING MOTION FIELDS AND cal and horizontal components are estimated from the same 

GLOBAL AFFINE MOTION PARAMETERS (N+2S)x(N+2S) target area. 

IN DIGITAL VIDEO SEQUENCES S. Cain and K. Sauer, "Efficient Block Motion Estimation 
onr n ac tot im/cwnnM 5 Usin S ^g™ 1 projections", IEEE Visual Signal Processing 
HELD OF THE INVENTION Workshop, pages 258-263, September 1992 describe a 
This invention relates to the processing of digital video technique for integral projection for block motion estimation 
images, and specifically to estimating local displacement in a video coding system. Their technique includes a multi- 
vectors and global affine parameters between two digital step approach which works as long as displacements are 
images. The quasi -projection matching method of the inven- 10 small. However, Cain et al. do not recognize that there are 
tion may be used in applications such as a) motion com- alternative approaches when the motion in the image is 
pensation in digital video compression system b) motion larger. In addition, they do not consider any extension of 
compensation in digital video conversion systems (temporal their concept to the problem of estimating global transla- 
filtering) c) reconstruction of panoramic images (also called tional or affine motion parameters. 

mosaics) d) motion-based video indexing and retrieval d) 15 I. H. Lee and R. H. Park, "A Fast block matching 

improved image resolution and e) image stabilization. Algorithm Using integral projections", Proceedings of TEN- 
CON 87, Vol. 2, Seoul, Korea, August 25-28, 1987, consider 

BACKGROUND OF THE INVENTION a fast mcthod for doing intc ^ al * ojection in bbck match . 

The current state of the art includes two classes of ing. They propose a multi-step approach with the purpose to 

techniques for estimating motion vectors between two reduce the number of visited pixel locations. Like in the 

images. The first class includes block-matching-based previous case, they do not consider other applications of the 

motion estimation techniques where individual image method. In their approach, the small displacement limitation 

sample values within a source image block are matched to is resolved by the fact that the integral projection they use 

individual image samples within a candidate target image performs averaging over the target block only, 

block. Each candidate target image block is positioned to 2 Consequently, a search is required to estimate each displace- 

correspond to one location in a predefined search window. ment vector as in any conventional block matching algo- 

Block matching-based motion estimation are widely used in rithm. 

today's real-time digital video compression systems. E. Ogura, Y. Ikenaga, Y. Iida, Y Hosoya, M. Takashima, 

Block matching is an important tool used in various 30 K. Yamashita, "A Cost Effective Motion Estimation Proces- 

digital video applications that require local/global corre- sor LSI Using a Simple and Efficient Algorithm", IEEE 

spondences between different parts of one image, or between Transactions on Consumer Electronics, Vol. 41, No. 3, 

two different images. A few of the applications which use August 1995, consider various simplifications of the integral 

block matching in one form or another include compression, projection technique which involve further averaging of the 

tracking, recognition, and video content analysis. A typical 35 column and row vectors. They also consider a general 

full search block matching-based motion compensation sys- architecture which supports these simplifications as well as 

tem requires 2N 2 (N+2S) 2 additions/subtractions per picture half-pel precision motion estimation. As in the two previous 

element (pixel or pel), where N is the horizontal and vertical cases, they do not address other applications, and they use 

size of the source and target blocks over which matching is integral projection technique in conjunction with searching 

performed, and S defines the extent of the area (in pixel 40 to make sure that large displacements may be estimated 

units) extending beyond the boundaries of the non-displaced properly, 
target block. The size of the search domain is therefore a 

(N+2S)x(N+2S) pixel area. The number of operations SUMMARY OF THE INVENTION 

quoted above accounts for N 2 subtractions and N 2 additions A mat u~A ~t *• *• « u ■ i a *• 

7 , i t j i • . . . . . A method of estimating a motion field includes estimating 

to calculate and accumulate distortion values, respectively, A c „ # e 

*u * a- . • u *u / i_ i ; 45 a first motion vector component from a source average 

assuming that distortion is given by the sum of absolute . , ntnnt , a- i • *u u 

• au . i l c j j- * vector and a target average vector; displacing the search area 

pixel value differences (the actual number of additions is ,^ r ^ ir ,„ tn ,uL Jt»- ♦ * a 

KT2-1- c • j i ■ . . . r according to the first motion vector component; and esti- 

N ' for simplicity, it is assumed that the actual number of t - j .■ . r LJ . 

aa *- ■ <t2\ ^L- u . c . £ mating a second motion vector component from the dis- 

addiuons is N ). This number accounts for the number of placed search area 

visited positions in the search domain which is equal to sn 

(N+2S) 2 . Computation may be reduced to some extent by M ob J ect of the mention is to provide an improved 

using a hierarchical approach: technique to the integral projection method. 

Coarse block matching is performed on a small version of Another object of the invention is to provide a method of 

the image, obtained with some form of multi-resolutional estimating global motion parameters in a given motion 

analysis, followed by fine block matching with a much 55 model, such as the affioe motion model or the translational 

smaller search area on the original image. motion model. 

The second class of techniques for estimating motion A ^her object of the invention is to provide a method 

vectors between two images 15 includes integral projection that nas a computational advantage over the block-matching 

techniques for block motion estimation. Integral projection based motion computation technique, 

is a technique in which two-dimensional image array match- 60 Still another object of the invention is to provide a method 

ing is replaced by matching of two one-dimensional vectors of motion computation that improves the accuracy of motion 

obtained by averaging image sample values horizontally and estimation when compared to the integral projection 

vertically, respectively. In the target block, the image block method. 

in the previous image, averaging is performed over the block Another object of the invention is to provide video 

plus the search area around it. In the source block, the image 65 consumer devices, such as digital video camcorders to 

block in the current image, averaging is performed over the rapidly generate motion fields with minimum hardware 

block only. Current integral projection methods suffer from costs. 
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A further object of the invention is to provide applications (N+2S)x(N+2S) target area. The method of the invention 

for motion fields, including motion compensated compres- takes advantage of the fact that in most cases, one of the 
sion of video sequences, velocity-tuned (temporal) filtering components, typically, the vertical displacement component, 

of video sequences and motion analysis of video content for is smaller than the other component, typically, the horizontal 

the purpose of video indexing and retrieval. 5 displacement component. 

Yet another object of the invention is to provide digital The method of the invention improves accuracy of the 

video cameras that include means for estimating global motion estimates by 1) estimating first, the largest motion 

motion parameters from one digital image to another. component; 2) displacing target area according to this 

motion component estimate; and 3) estimating next, the 

BRIEF DESCRIPTION OF THE DRAWINGS 10 smaller motion component from the new target area. The 

FIG. 1 is an illustration depicting the computation of fot ^P lion » thc two blocks l ° be matched have 

averages are computed in a integral projection method. most of their areas m common. This is equivalent to the 

....... — - ... , assumption that the block size is much larger than the search 

FIG. 2 is a block diagram of the quasi-projcctiOD match- area 1q Qrder {Q fifld ^ ver(ical * enl of the motioQ 

ing method of the invention. 15 vfx ^ ^ ^ ^ tfae rQws fc determmed and matched tQ 

FIG. 3 is a block diagram of an alternate embodiment of the resulting vertically oriented one-dimensional vectors, 

the quasi-projection matching method of the invention. This works only because the horizontal displacement is 

FIG. 4 is a block diagram of another embodiment of the assumed to be small. If the horizontal displacement is large, 

quasi-projection matching method of the invention. the resulting matching process will not yield acceptable 

20 results. Put another way, large horizontal displacements lead 

DETAILED DESCRIPTION OF THE t0 dissimilar horizontal projection vectors, which lead to 

PREFERRED EMBODIMENTS unacceptable vertical displacement estimates. The horizon- 

The invention is referred to herein as the quasi-projection ,al component of the motion vector may be similarly cal- 

matching method. The invention offers drastic computa- culated assuming that the vertical component of the motion 

tional advantages compared to block matching-based 25 vector is small. 

motion estimation technique. A typical full search block The first embodiment of the invention provides a detailed 

matching-based motion compensation system requires 2N description of the three step motion estimation procedure 

(N+2S) 2 additions/subtractions per picture element (pixel), mentioned just above. A second embodiment of the inven- 

where N is the horizontal and vertical size of the source and tion describes an extension of the first embodiment whereby 

target blocks over which matching is performed, and S the method is used in an iterative scheme to perform motion 

defines the extent of the area (in pixel units) extending estimation. A third embodiment of the invention describes a 

beyond the boundaries of the non-displaced target block. motion estimation system wherein the method is used to 

The size of the search domain is therefore a (N+2S)x(N+2S) refine the estimates produced by the integral projection 

pixel area. The number of operations quoted above accounts method. A fourth embodiment of the invention describes an 

for N 2 subtractions and N 2 additions to calculate and accu- 35 extension to the first three embodiments, where the method 

mulate distortion values, respectively (assuming that distor- is used in an hierarchical scheme to perform motion esti- 

tion is given by the sum of absolute pixel value differences). mation. A fifth embodiment of the invention is an extension 

This number accounts for the number of visited positions in where the method is used as an initial step in estimating the 

the search domain which is equal to (N+2S) 2 . The method of parameters of a global motion model, 

the invention only requires 8(N+S) 2 additions/subtractions. The quasi-projection matching method of the invention 

This number accounts for N 2 and (N+2S) 2 additions to may be used in applications such as a) motion compensation 

compute row and column averages in the source and the in digital video compression system b) motion compensa- 

target blocks, respectively. tion in digital video conversion systems (temporal filtering) 

The method requires only 2N additions/subtractions in 45 c) reconstruction of panoramic images (also called mosaics) 
order to calculate and accumulate distortion values associ- d) motion-based video indexing and retrieval d) Improved 
ated with each of the (N+2S) possible displacements, and image resolution and e) image stabilization, 
requires that such calculations need to be carried out twice, i n particular, the invention may be implemented in video 
once along the horizontal dimension and once along the consumer devices such as digital video camcorders to rap- 
vertical dimension. 5Q idly generate motion fields with minimum hardware costs. 

Quasi-projection matching is shown to be a better alter- Applications for such a motion field includes motion com- 

native when compared to projection matching in such situ- pensated compression of video sequences, velocity-tuned 

ations where block size and search range are comparable, (temporal) filtering of video sequences and motion analysis 

typically when 3x search range <block size<10x search of video content for the purpose of video indexing and 

range, i.e., 3x<N<10S. If block size<3x search range, neither 55 retrieval. 

method performs well. When the block size is much larger Additionally, the invention may be used in digital video 

than the search area, projection matching approaches the cameras to estimate global motion parameters from one 

performance of the quasi-projection matching method of the digital image to another. Consider a sequence of images 

invention. It will be apparent to those of skill in the art that st0 red in a digital camera where a different part of a larger 

both projection matching and quasi-projection matching 60 scen e has been captured in each image. Global motion 

become more accurate with increases in block size for a parameters are estimated between each consecutive image in 

constant size search range. lne sequence by using the quasi-projection matching tech- 

The invention improves the accuracy of the motion esti- nique of the invention. The resulting global motion param- 

mation process of the integral projection method. More eters may be used to align and compose the images in order 

specifically, current integral projection methods suffer from 65 to reconstruct a panoramic view of the scene, also called a 

limitations in estimating large displacements because verti- mosaic. The global motion parameters may also be used to 

cal and horizontal components are estimated from the same stabilize the video sequences generated by the video camera. 
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Another application for the method of the invention is to 
reconstruct images with improved resolution. In this case, 
both local motion fields and global motion parameters, 
estimated with sub-pixel accuracy, are used to align images 
whose content has been subject to either global or local 
fractional displacements. The images are then merged to 
generate a larger image exhibiting improved resolution. The 
generation of the panoramic images may take place either in 
the digital camera or remotely, as in a PC. 

The concept of the integral projection method (IPM) is 
illustrated in FIG. 1, generally at 10. Consider a current, or 
source, image 12 having an N^N* image block 14, with N v 
vertical rows of pixels and N A horizontal columns of pixels, 
where N is the horizontal or vertical size of the source and 
target blocks over which matching is performed, whose 
upper left corner is located at pixel 16 (q,p) where q is the 
vertical coordinate and p is the horizontal 



*=/> 



(1) 



coordinate. Row averages 
where q^i>q+N v , and column averages 
where p^j^p+N,,, are calculated in the source image. This 
operation yields two one 



*=<? 



(2) 



dimensional vectors, called the source row average vector 
and the column row average vector, with N„ and N h entries, 
respectively. I(kj) and I(ijc) denote image pixel values at 
pixel locations (kj) and (ijc), respectively. S is the extent, or 
range, of search in pixel units, extending beyond the bound- 
aries of the non -displaced target block. 

Using a search area centered about the same N^N A image 
block 22 in a target image 20 and of size N V +2S V , rows and 
N h +2S h columns along the vertical and horizontal 
dimension, respectively, row averages 



(3) 



10 



20 



where q-S v ^i^q+N v -l+S v , and column averages 

k=q+N v -l+S v 



(4) 



N V +2S V 



25 



35 



45 



50 



55 



where p-S^iiP+N^l+S^ are computed. This operation 
yields two one -dimensional vectors, called the target column 
average vector and the target row average vector, with 60 
N V +2S V and N A +2S A entries, respectively. Vertical motion is 
estimated by minimizing a cost function, usually the sum of 
the absolute difference, between the lxN v source row aver- 
age vector and N v consecutive values in the target row 
average vector. Likewise, horizontal motion is estimated by 65 
minimizing a cost function between the lxN/, source column 
average vector with N A consecutive entries in the target 



column average vector. The components of the resulting 
motion vector are the vertical and horizontal offset values (a 
value in the range [-S w +S v ] and [-S A ,+SJ^espectively) 
which were found to minimize the cost functions. Note that 
both the horizontal and vertical motion components are 
estimated simultaneously from the same search area in the 
target image. 
First Embodiment 

The first embodiment of the quasi-projection matching 
method of the invention will now be described. The term 
"quasi" comes from the fact that it involves estimating each 
motion component independently, using a different search 
area, as opposed to the integral projection technique where 
the same search area is used to estimate both motion 
components. Comparisons of the techniques have shown 
that quasi-projection matching method of the invention 
provides better results than integral projection when 
3xS v <N v <10xS v and 3xS A <N,,<10xS /l . 

The first embodiment of the method for estimating a 
motion field includes estimating a first motion vector com- 
ponent from a source average vector and a target average 
vector; displacing the search area according to the first 
motion vector component; and estimating a second motion 
vector component from the displaced search area 

FIG. 2 provides an illustration of the invention first 
embodiment. Compared with the IPM of FIG. 1, only one, 
i.e., the first of the motion vector components (here the 
vertical component) in target image 36, having an image 
block 38, is estimated from the source average vector and 
the target average vectors (here the row average vectors) in 
the first iteration from source image 32, having image block 
34 therein formed of plural pixels. The search area is then 
vertically displaced according to the estimated vertical 
motion. The remaining, second, motion component (here the 
horizontal motion) is then estimated in the second iteration 
in target image 40, having image block 42, once the new 
target average vector (here the column average vectors) has 
been computed. 

Note that the first component to be estimated is pre- 
defined, horizontal or vertical. In digital video sequences, 
vertical displacements are typically much smaller than hori- 
zontal displacements, so it makes more sense in this case to 
estimate horizontal motion first since entries of the source 
and target column average vectors are minimally affected by 
a small vertical motion. 
Second Embodiment 

Alternately, the horizontal and vertical displacements may 
be calculated, compensating those values using the most 
recent horizontal displacement before calculating the verti- 
cal displacement, and vice-versa. The advantage of this 
technique is that better motion vectors are obtained, 
progressively, with each iteration, provided that the initial 
estimates have some validity. The search area may be 
reduced with each iteration, which also reduces the amount 
of computational effort. The horizontal and vertical projec- 
tion vectors needed in the later iterations may be pre- 
computed using a running average in the first iteration. Thus, 
the computational increase amounts to an addition of two 
times the search dimension times the block dimension for 
each iteration. As used herein, an "iteration" is a computa- 
tion of both horizontal and vertical displacements. Because 
the block dimension is typically much larger than the search 
dimension, the amount of computational increase is mini- 
mal. Generally, not more than two iterations are needed in 
most cases to achieve convergence, given that the block size 
is reasonably larger than the search area. 

FIG. 3 provides an illustration of the second embodiment 
of the invention, generally at 50. The beginning image is 
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current, or source, image 52. As in the first embodiment, the interpolating the column and row average vectors by a 

identification of the first component to be estimated is predefined factor. For example, half -pel motion accuracy 

pre-defined (horizontal or vertical). Since vertical displace- may be achieved by interpolating the average vectors by a 

ments are typically much smaller than horizontal factor of 2. The interpolation scheme may be based on a 

displacements, it makes sense again to estimate horizontal 5 parametric filter, i.e., cubic filter, or a conventional low-pass 

motion first. The resulting target image 56 includes an image mter of a predetermined order. In the case of the second and 

block 58, which is the result obtained in the I th iteration, third embodiments, such refinements may be performed at 

while the second target image 60 includes an image block lhe final tw0 ilerati ™s only, to refine both components to 

62, obtained in the (1+1)'* iteration. f b ~^ a ^acy 

This embodiment is a modification of the first embodi- io Fo .!iL th Embodiment 

mem because the procedure described above is repeated a The quasi-pi^crtioa matching technique ot the invention 

, ft - ,f . • i j l may be embedded in a hierarchical scheme. The first 

number or times. Here, vertical and horizontal displacement /** . <?.u . u • u _r j 

ti . . . j . . Y , . embodiment of the technique may be performed on images 

are alternatively estimated using quasi-projection matching which have been spatially decimated by a factor of 2", where 

after re-centering the search area according to the estimate 0 ^ n 0ncc thc motion csiimMs havc becn ^puted, they 

produced by the previous iteration. Consequently horizontal 15 are multiplied by 2, and used to align the same two images, 

and vertical displacements of the search area alternate until which are obtained from the original size images by deci- 

the process is stopped. The size of the search area may be mation by a factor of 2"" 1 . This process may be repeated 

changed from one iteration to another. For example, it is recursively, finally processing images at the same resolution 

possible to reduce the size of the search area as iterations go as that of the orignal images. The hierarchical scheme may 

on since motion estimates get progressively refined. FIG. 3 20 also proceed in steps larger than 1, e.g., a decimation factor 

illustrates the case where the component estimated at the X th of 2 n ~ 2 may be used immediately after the decimation factor 

iteration is the horizontal motion component. The search of 2", however, the motion vectors must be multiplied by a 

area is then displaced and the vertical motion is estimated at factor of 4 in such a situation. Likewise, the second and third 

iteration (1+1) once a new target horizontal average vector embodiments may support a hierarchical scheme. The itera- 

has been computed. 25 tive process of the second and third embodiments may be 

Third Embodiment combined with the hierarchical estimation scheme of the 

In this variation, both the horizontal displacement and the fourth embodiment. The number of iterations performed at 

vertical displacement are calculated without any compensa- each resolution level may be made variable. For example, 

tion in the first iteration, i.e., using simple projection match- one might perform two iterations at each image resolution, 

ing. The displacement which produces the least mismatch is 30 or one iteration at the lowest resolution, and all the other 

used to perform motion compensation before estimating the iterations at the original image resolution, 

other displacement. The rest of the iterative process is The quasi-projection matching method of the invention 

similar to the method used in the second embodiment. may also be embedded in a hierarchical scheme. For 

Although, in the second embodiment, there is an inherent example, the first embodiment may be applied to video 

assumption that the displacements are approximately 35 images which have been spatially decimated by a first 

correct, in the situation where that assumption is not correct, predetermined decimation factor Q n , where Q is a given 

the third embodiment partially remedies that situation at positive integer and n is an integer greater than zero. Once 

only a slight increase in computational burden. the motion estimates have been computed, they are multi- 

FIG, 4 illustrates the third embodiment of the invention, plied by (y, where p is an integer, selected such that l>p<n, 

generally at 70. The third embodiment differs from the first 40 and used to align the same two images but decimated by a 

and second embodiments in that the first component to be second predetermined decimation factor, Q n ~ p . The process 

estimated, i.e., horizontal or vertical, is identified in the may be repeated until quasi-projection matching technique 

initial integral projection step. A source image 12 includes has been used across all scales including the original image 

an image block 74. A first target image 76 includes an image resolution (n=0). 

block 78, obtained after the initial iteration, while a second 45 The second and third embodiments may be integrated in 

target image 80 includes an image block 82, obtained after a hierarchical scheme in the same fashion. For example, it 

a second iteration. might be chosen to perform two iterations at each image 

In this embodiment, the integral projection process is used resolution. The number of iterations may be made variable 

to provide an initial rough estimate of both the initial across the various image scales. In the simplest implemen- 

horizontal and initial vertical displacements, to provide a 50 tation of this hierarchical scheme, only one iteration is 

good initial estimate of the motion vector. The quasi- performed at each scale, 

projection matching method is then used as described above. Fifth Embodiment 

At the beginning of the first iteration, the search area is The fifth embodiment of the method of the invention 

displaced according to the initial motion component which extends the previously described embodiments. Here, the 

produced the best match. The example in FIG, 4 shows an 55 quasi-projection matching method of the invention is used to 

initial motion vector pointing the lower left comer of the estimate parameters of a global motion model. For example, 

image. This vector is the motion estimate produced by the the general formulation of an affine model is x^-Ax^+D, 

integral projection technique in the initialization step. where x t ^ and x, represent the positions of a pixel at time 

Assume that the horizontal component is the one that t+1 and t, respectively, where A represents the transition 

yielded the best match. Therefore, before starting the second 60 matrix and where D represents the global translation vector, 

iteration, the search area is moved to the left according to the The procedure to estimate global affine model parameters 

initial horizontal motion estimate. The quasi-projection (6 motion parameters) from two consecutive images is 

matching method is then used to estimate the vertical motion described below. 

component. As before, the size of the search area may be Step 1: Estimate global translation components by using 

changed between one iteration and another. 65 either the integral projection technique or the Quasi- 

The quasi-projection matching method of the invention Projection technique. The block size is typically large and 

may be designed to produce sub-pel accuracy by simply the search area is relatively small. 
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Step 2: Align the two images according to the global 
translation estimate found in the preceding step. 

Step 3: Subdivide the image into small non-overlapping 
blocks. 

Step 4: Select blocks whose content exhibit large variations. 5 
Various techniques exist for estimating the level of varia- 
tions within a block (standard deviation, sum of the 
absolute value after filtering the block with a high-pass 
filter such as the So be I operator). A block is selected if 
content activity exceeds a pre-defined threshold T 1. 1Q 

Step 5: Given a selected image block, apply quasi-projection 
matching technique (possibly with sub -pixel accuracy) to 
aligned images (see step 2) to produce a motion vector. 
The purpose of this step is to estimate the space-varying 
local residual displacements induced by the transition 
matrix A. Test projection mismatch error and discard 15 
measurement if mismatch is greater than a pre-defined 
threshold T2. Otherwise, add estimated residual transla- 
tion to global displacement and store result. 

Step 6: Count current number of stored results. Repeat step 
5 if this count has not exceeded a pre-defined threshold 20 
value T3, T3>2. (At least three measurements are neces- 
sary to be able to estimate the six parameters of the affine 
motion model). 

Step 7: Estimate global afifine parameters if number of stored 
results is greater than T3. Estimation of the afSne param- 25 
eters (the matrix A and the vector D ) may be performed 
by calculating a Least-Square fit between the model and 
the motion data. 

In particular, the invention may be implemented in video 
consumer devices such as digital video camcorders to rap- 30 
idly generate motion fields with minimum hardware costs. 
Applications for such a motion field includes motion com- 
pensated compression of video sequences, velocity-tuned 
(temporal) filtering of video sequences and motion analysis 
of video content for the purpose of video indexing and 35 
retrieval. 

Additionally, the invention may be used in digital video 
cameras to estimate global motion parameters from one 
digital image to another. Consider a sequence of images 
stored in a digital camera where a different part of a larger 40 
scene has been captured in each image. Global motion 
parameters are estimated between each consecutive image in 
the sequence by using the quasi-projection matching tech- 
nique of the invention. The resulting global motion param- 
eters may be used to align and compose the images in order 45 
to reconstruct a panoramic view of the scene, also called a 
mosaic. The global motion parameters may also be used to 
stabilize the video sequences generated by the video camera. 

Another application for the method of the invention is to 
reconstruct images with improved resolution. In this case, 50 
both local motion fields and global motion parameters, 
estimated with sub-pixel accuracy, are used to align images 
whose content has been subject to either global or local 
fractional displacements. The images are then merged to 
generate a larger image exhibiting improved resolution. The 55 
generation of the panoramic images may take place either in 
the digital camera or remotely, as in a PC. 

Although a preferred embodiment of the invention, and 
several variations thereof have been disclosed, it will be 
appreciated that further variations and modification may be 60 
made thereto without departing from the scope of the 
invention as defined in the appended claims. 

I claim: 

1. A method of estimating motion fields comprising: 
estimating a first motion vector component in a first single 65 

direction from a source average vector and a target 

average vector; 
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displacing the search area according to the first motion 

vector component; and 
estimating a second motion vector component in a second 

single direction at 90° to the first single direction from 

the displaced search area. 

2. The method of claim 1 which, after said displacing, 
includes the step of recentering the search area as a function 
of the previous iteration. 

3. The method of claim 2 which is used in a hierarchical 
scheme, and which includes spatially decimating two video 
images by a first predetermined decimation factor; Q", 
deriving motion estimates; multiplying the derived motion 
estimates by Q?; aligning, as a function of the motion 
estimate multiplied by (F, the original images decimating 
by a second predetermined decimation factor, Q"~ p , where Q 
is a given positive integer, n is an integer greater lhan zero, 
and p is an integer having a value between 1 and n. 

4. The method of claim 2 wherein said estimating a first 
motion vector includes alternating horizontal and vertical 
displacements. 

5. The method of claim 4 which is used in a hierarchical 
scheme, and which includes spatially decimating two video 
images by a first predetermined decimation factor; Q", 
deriving motion estimates; multiplying the derived motion 
estimates by Q^; aligning, as a function of the motion 
estimate multiplied by Q* 7 , the original images decimating 
by a second predetermined decimation factor, Q"~ p , where Q 
is a given positive interger, n is an integer greater than zero, 
and p is an integer having a value between 1 and n. 

6. The method of claim 1 which includes, prior to said 
estimating a first motion vector, using an integral projection 
process to estimate initial vertical and horizontal displace- 
ment. 

7. The method of claim 6 which is used in a hierarchical 
scheme, and which includes spatially decimating two video 
images by a first predetermined decimation factor; Q rt , 
deriving motion estimates; multiplying the derived motion 
estimates by 0^; aligning, as a function of the motion 
estimate multiplied by (y, the original images decimating 
by a second predetermined decimation factor, Q""^ where Q 
is a given positive integer, n is an integer greater than zero, 
and p is an integer having a value between 1 and n. 

8. The method of claim 1 which further includes the steps 
of: 

estimating global translation components by using a tech- 
nique taken from the group of techniques consisting of 
an integral projection technique and quasi-projection 
technique; 

aligning the two images according to the global transla- 
tion estimate of the preceding step; 

subdividing the image into small, non-overlapping 
blocks; 

selecting blocks whose content exhibit large variations; 
applying a quasi-projection matching technique to a 

selected image block to align the images to produce a 

motion vector; 
evaluating a matching error for a selected image block 

using the produced motion vector for the image block; 
counting the number of image blocks having matching 

errors below a predetermined threshold value T2 and 

storing motion vectors for such blocks; 
estimating global aflSne parameters if the number of 

image blocks having matching errors below T2 is 

greater than T3>2; and 
estimating an affine parameter using the motion vectors of 

all image blocks having a matching error below T2. 
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9. The method of claim 1 which is used in a hierarchical 
scheme, and which includes spatially decimating two video 
images by a first predetermined decimation factor; Q", 
deriving motion estimates; multiplying the derived motion 
estimates by Q^; aligning, as a function of the motion 5 
estimate multiplied by cy, the original images decimating 
by a second predetermined decimation factor, Q"~ p , where Q 

is a given positive integer, n is an integer greater than zero, 
and p is an integer having a value between 1 and n. 

10. A method of estimating motion fields comprising: id 
estimating a first motion vector component in a first single 

direction from a source average vector and a target 
average vector; 

displacing the search area according to the first motion 
vector component; 15 

estimating a second motion vector component in a second 
single direction at 90° from the first single direction 
from the displaced search area; and 

spatially decimating two video images by a first prede- 2 n 
termined decimation factor; Q", deriving motion esti- 
mates; multiplying the derived motion estimates by CK; 
aligning, as a function of the motion estimate multi- 
plied by C, the original images decimating by a second 
predetermined decimation factor, (T~ p where Q is a 2 s 
given positive integer, n is an integer greater than zero, 
and p is an integer having a value between 1 and n. 

11. The method of claim 10 which, after said displacing, 
includes the step of recentering the search area as a function 

of the previous iteration. 30 

12. The method of claim 10 wherein said estimating a first 
motion vector includes alternating horizontal and vertical 
displacements. 

13. The method of claim 10 which includes, prior to said 
estimating a first motion vector, using an integral projection 35 
process to estimate initial vertical and horizontal displace- 
ment. 

14. The method of claim 10 which further includes the 
steps of: 

estimating global translation components by using a tech- 40 
nique taken from the group of techniques consisting of 
an integral projection technique and quasi-projection 
technique; 

aligning the two images according to the global transla- 
tion estimate of the preceding step; 45 

subdividing the image into small, non-overlapping 
blocks; 

selecting blocks whose content exhibit large variations; 
applying a quasi-projection matching technique to a 50 

selected image block to align the images to produce a 

motion vector; 

evaluating a matching error for a selected image block 
using the produced motion vector for the image block; 

counting the number of image blocks having matching 55 
errors below a predetermined threshold value T2 and 
storing motion vectors for such blocks; 
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estimating global affine parameters if the number of 
image blocks having matching errors below T2 is 
greater than T3>2; and 

estimating an affine parameter using the motion vectors of 
all image blocks having a matching error below T2. 

15. A method of estimating motion fields comprising: 
estimating a first motion vector component in a first single 

direction from a source average vector and a target 

average vector; 
displacing the search area according to the first motion 

vector component; 
recentering the search area as a function of the previous 

iteration; 

estimating a second motion vector component in a second 
single direction at 90° to the first single direction from 
the displaced search area; 

spatially decimating, in a hierarchical scheme, two video 
images by a first predetermined decimation factor, 
deriving motion estimates; multiplying the derived 
motion estimates by Q^; aligning, as a function of the 
motion estimate multiplied by (F, the original images 
decimating by a second predetermined decimation 
factor, (T~ p 3 where Q is a given positive integer, n is an 
integer greater than zero, and p is an integer having a 
value between 1 and n; and 

estimating global translation components by using a tech- 
nique taken from the group of techniques consisting of 
an integral projection technique and quasi-projection 
technique; 

aligning the two images according to the global transla- 
tion estimate of the preceding step; 

subdividing the image into small, non-overlapping 
blocks; 

selecting blocks whose content exhibit large variations; 
applying a quasi-projection matching technique to a 

selected image block to align the images to produce a 

motion vector; 
evaluating a matching error for a selected image block 

using the produced motion vector for the image block; 
counting the number of image blocks having matching 

errors below a predetermined threshold value T2 and 

storing motion vectors for such blocks; 
estimating global affine parameters if the number of 

image blocks having matching errors below T2 is 

greater than T3; 
estimating an affine parameter using the motion vectors of 

all image blocks having a matching error below T2. 

16. The method of claim 15 wherein said estimating a first 
motion vector includes alternating horizontal and vertical 
displacements. 

17. The method of claim 15 which includes, prior to said 
estimating a first motion vector, using an integral projection 
process to estimate initial vertical and horizontal displace- 
ment. 

* * * * * 
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